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In  the  traditional  literature  concerning  A71,  2-3,  bounded 
balance  or  multiway  3-trees,  it  has  been  assumed  that  a 
pointer-changing  operation  would  require  approximately  one 
unit  of  runtime.  This  anr nomination  is  inane li cable  to  the 


augmented  trees  of  3S77,  1u78,  lw78b,  V/iTSa  and  V.’i76c,  because 
these  trees  associate  an  auxiliary  data  structure  to  each 
interior  node,  which  complicates  the  cost  of  pointer-changing 
operations.  Such  an  operation  will  consume  1  +  Jw  units  of 
runtime  in  an  augmented  tree  application  (where 'J  demotes  the 
number  of  records  that  gain  a  new  ancestor  as  a  result  of  the 
pointer-changing  operation,  and  w  is  a  coefficient  depending 
on  the  particular  application). 


The  purpose  of  this  paper  will  be  to  pr.tose  an  algorithm 
which  is  a  generalization  of  traditional  2--:rees  designed  to 
possess  an  C(v;log  IT)  worst-case  insertion  and  deletion  runtime 
in  an  augmented  tree  environment ,  This  algorithm  is  signifienr. 
because  it  has  the  optimal  runtime  order  of  magnitude. 


The  Suner-5-Trse  Algorithm 
By  Dan  E.  7/illard 


During  the  last  year  three  papers  (BS77,  Lu78.  '.7178a)  have 
independently  proposed  the  use  of  a  new  data  structure  whose 
most  ger.era.1  fora  was  called  an  augmented  tree  in  7,'i78c.  .-jo 
augmented  tree  is  defined  as  a  tree  representation  of  a  sorted 
list  in  which  ever:*  interior  node  contains  a  new  field  of  infor¬ 
mation  which  Y/i78a  calls  a  "subtree  description  structure  (313)." 
In  the  content  of  an  interior  node  v,  the  tern  "SIS"  refers  to 
any  arhitrarr*  user-defined  data  structure  that  describes  v's 


descendants.  If  T  is  a  tree  sorted  by  E2Y.1,  then  one  errands 


of  an  3DS(v)  field  is  a  sorted  list  that  enumerates  v's  descen¬ 
dants  by  order  of  increasing  KEY. 2  value.  Another  example  is  a 
partial  natch  data  structure  representation  (3e75 ,  ?.i76.  7.'i7Sb, 
Y/i78c)  of  v's  descendants.  The  definition  of  SDS  fields  in 
7/i?8a  was  carefully  worded  to  er.conpass  all  future  possibilities 
by  indicating  that  5DS(v)  nay  be  any  data  structure  descricir.- 
v's  descendants. 


Research  into  the  retrieval  capacities  of  augmented  trees 
only  seriously  began  during  the  last  year.  2577,  Iu73,  1.773b 
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and  Y/i78a  verified  that  searches  for  the  records  satisfying  a 
k-dimer.si onal  query  of  the  font 

ax<  EZY.1<  h-L  &  &2<  SEY.2<  eb  ...  ak<  EEY.k<  \ 

can  be  performed  in  C(log^  21)  tine  with  a  special  augmented  tree 
that  7/i78c  calls  a  ?0(k)  pyramid.  Uore  efficient  augmented  trees 
we  re  also  discussed  in  7/i78c.  The  article  displays  an  improved 
data  structure  that  enables  k-dinensi onal  queries  to  be  performed 

in  0(lcg  IT)  time  without  any  serious  accompanying  disadvantage . 

In  the  special  case  where  k  =  2,  this  result  reduces  to  a  data 

structure  that  occupies  C(I!log  IT)  space  and  has  the  same  C(lcg  IT) 
worst-case  retrieval  time  as  one -dimensional  sorted  lists.  There 
are  also  many  additional  retrieval  theorems  about  augmented  trees 
mentioned  in  7/i78a  and  '.7i78c. 

The  purpose  of  this  paper  will  be  to  explain  how  records 
can  be  efficiently  inserted  into  and  deleted  from  augmented 
trees.  Our  algorithm,  called  the  super-2 -tree  procedure,  will 
be  a  generalization  of  traditional  B-tree  procedures  which  is 
specifically  designed  for  the  case  of  augmented  trees.  Zee 
runtime  of  this  procedure  can  best  be  explained  by  letting  w 
denote  the  amount  of  runtime  needed  for  a  record  to  be  inserted 
into  or  deleted  from  an  SDS  field  (in  a  particular  application) . 
Under  the  3uper-3-tree  algorithm,  any  record  can  be  inserted 
into  or  deleted  from  an  augmented  tree  in  C(wlog  IT)  v.orst-case 
runtime . 

The  surer-3-tree  a roc e dure  will  be  cuite  different  from  its 


traditional  2-tree  counterparts  (such  as  the  AVI,  bounded-balance . 


i 

a- 
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2-3,  and  multiway  algorithms).  To  understand  the  reason  for  this 
let  J  denote  the  number  of  records  that  gain  or  lose  an  ancestor 
as  a  result  of  a  pointer-changing  operation.  Such  operations  wil 
clearly  require  0(1  +Jw)  runtime  in  the  context  of  an  augmented 
tree.  Consideration  of  the  degenerate  case  where  indicates 

that  all  traditional  1-tree  algorithms  must  have  an  C(v;IJ)  worst-c 
insertion  and  deletion  runtime  when  mani tula tin ~  augmented  trees. 
The  super-B-tree  algorithm  will  thus  require  several  further 
optimizations  to  attain  its  0(wlog  N)  worst-case  runtime. 

In  addition  to  considering  worst-case  runtime,  this  paper 
vh.ll  also  examine  a  weaker  method  of  measuring  runtime  called 
CERT.  The  definition  of  CERT,  introduced  in  '.7i76a,  requires  that 

i)  the  symbol  A  denote  an  algorithm  which  performs  insertion 
and  deletion  operations  in  a  data  structure  denoted  as  1 

ii)  C  denote  a  sequence  of  insertion  and  deletion  commands 
whose  length  is  denoted  as  I  C  ) 

iii)  data  structure  2  represent  the  emnty  set  of  records  csfor 


command  sequence  C  i_s  executed 

Under  these  circumstances,  algorithm  A  will  be  said  to  have  a  CZR 
(the  acronym  stands  for  "Conservative  Estimate  of  Runtime" ;  equal 
to  R  iff  C  |  H  |  represents  the  me.ximum  amount  of  time  that  any 
sequence  C  can  force  A  to  consume . 

An  article  giving  a  preliminary  explanation  of  the  CZZT- 
measured  cost  of  insertion  and  deletion  records  in  augmented 
trees  was  Lu7 8.  The  discussion  in  Lu78  centered  around  a  data 
structure  which  was  called  a  ?q(1:)  pyramid  in  ":i7Sc.  ?^(k) 

pyramids  are  inductively  d 


efined  a3  follows: 
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i)  Pq(1)  is  defined  zo  be  a  standard  tree  whose  records  are 
sorted  by  increasing  value  in  their  first  key 
ii)  Fg(k)  pyramids  are  augmented  trees  whose  records  are 
sorted  by  their  k-th  key  and  whose  SDS  fields  are 
Pg(k  -1)  pyramids 

Lu78  displayed  an  algorithm  that  inserted  and  deleted  records  in 
Pg(k)  pyramids  in  O(log^  I'T)  CSP.T  time.  The  discussion  in  Lu78 
did  not  consider  general  augmented  trees  of  arbitrary  complexity, 
and  its  main  algorithm  would  require  significant  revision  tc 
process  several  augmented  trees  in  C(v.*log  IT)  C1P.T  time.  Also 
Iu78  did  not  optimize  ;vorst-case  runtime. 

Pive  months  prior  to  the  presentation  of  Iu73,  I  submitted 
a  dissertation  ~o  the  Harvard  L'athematics  Department  ( ’.7i 7 ca ) 
whose  subject  matter  included  an  independent  derivation  of  thi3 
material,  as  -.veil  as  a  large  number  of  additional  and  more 
powerful  algorithms.  It  was  understandable  that  lueker ’ s 
presentation  did  net  contain  a  reference  to  the  more  powerful 
theorems  of  V/i78a,  as  the  Lu78  presentation  was  given  only 
shortly  thereafter. 

The  main  purpose  of  V/i78a  was  to  study  retrieval  in  zhe 
context  of  the  3et  of  records  satisfying  arbitrarily  complex 
predicates,  and  to  show  how  retrieval  —  as  well  as  the  evaluation 
of  the  set' 3  sums,  counts,  multiplicative  product,  universal 
cyuantifiers.  existential  quantifiers,  mean  values,  median  values, 
maximum  values,  and  minimum  values  —  can  be  performed  in 
OClog4*  IT)  or  less  worst-case  runtime  for  virtually  all 
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commercial  user  requests.  Cne  of  the  sever,  chapters  of  '.Vi 7 
discussed  an  algorithm,  that  was  substantially  acre  efficient 
than  the  somewhat  similar  update  procedure  which  later  appeared 
in  Lu78.  That  chapter  consisted  of: 

i)  a  demonstration  that  all  traditional  3-tree  algorithms 
require  C(wJI)  CEET  time  when  applied  to  augmented  urees 

ii)  a  description  of  a  procedure  fundamentally  different 

from  Lu?8  that  for  arbitrary  augmented  mrees  can  insert 
or  delete  a  record  in  C(wlog  IT)  CZ3T -measured  tine 

iii)  an  explanation  and  formal  proof  illustrating  how  the 

above  algorithm  can  be  improved  to  develop  a  procedure 
with  a  strict  O(wlog  N)  worst-case  tine 
A  similar  three-part  mode  of  presentation  vail  be  used  here. 

The  subject  matter  has  been  divided  so  that  topics  i)  and  ii) 
are  discussed  in  section  1,  and  topic  iii)  in  sections  2  and  3. 

All  propositions  will  be  given  two  theorem  numbers  in  this 
paper.  The  first  number  indicates  the  chronological  arrangement 
of  the  propositions,  and  the  second  indicates  where  the  sans 
proposition  can  be  found  in  V/i7Sa. 


(ft 
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PARI  1 

The  nature  of  the  challenge  faced  in  this  paper  car.  be 
understood  if  some  traditional  3-trees  (AVI,  2-3 t  and  multiway ) 
are  examined  in  detail.  Theorem  1  demonstrates  that  these 
algorithms  require  O(wN)  CERT  runtime  when  the  y  manipulate 
augmented  trees.  This  CERT  runtime  indicates  that  these  algorithms 
are  substantially  more  difficult  to  optimize  than  was  suggested 
by  the  simple  O(wlT)  worst-case  update  tine  mentioned  at  the 
beginning  of  this  paper. 

The  proof  of  Theorem  1  assumes  that  the  reader  is  familiar 
with  AVL,  2-3  and  multiway  trees,  at  least  to  the  point  of  Zn-72 
and  AKU-74.  Readers  not  familiar  with  these  references  may  omit 
this  proof  and  still  understand  most  of  the  rest  of  oh is  paper. 


Theorem  1: 


Application  of  the  AVI,  2-3 ♦  and  multiway  procedures 


to  augmented  trees  will  produce  an  algorithm  with  an  C ( w.E ,  CERT 
runtime  (Th  3.2.0). 


Proof  for  AVL  trees*. 


A  binary  tree  of  height  h  will  be  said 


to  have  naninal  size  if  it  contains  precisely  2'*  leaves.  let 
T^,  T3  three  such  mamimal-sized  trees  of  height  h 

Let  T  denote  that  AVL  tree  which  is  shown  in  the  diagram  below: 


, 
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Let  c-^  Cg  denote  a  sequence  cf  four  data-modificatior. 

commands  such  that 

i)  Comar d  c-.  causes  a  leaf  to  he  added  to  subtree  T, 

ii)  Command  C2  renoves  the  cited  leaf 

iii)  Cczaand  c^  causes  a  leaf  to  be  added  to  subtree 

iv)  Command  c^  removes  the  cited  leaf 

It  is  easy  to  verify  that  the  previous  four  commands  will  cause 
the  AVL  algorithm  to  move  T2  from  the  left  side  of  tree  1!  to 
the  right  side  and  then  subsequently  bach  to  the  initial  position. 
Note  that  this  cycle  of  four  commands  will  require  C(v;IT)  time  to 
manipulate  the  S2S  fields,  furthermore,  this  cycle  can  be  repeated 
any  number  of  times.  Each  repetition  will  thus  consume  an  addi¬ 
tional  O(wN)  time.  At  least  w.T2  runtime  must  therefore  be  consumed 
by  a  sequence  of  57,  commands  that  first  builds  the  initial  tree 
and  then  executes  IT  repetitions  of  this  circle.  Hence  the  A71 
algorithm  will  have  at  least  an  0(v/IT)  CEE?  runtime  when  it 
manipulates  augmented  trees.  QED 


Proof  for  the  case  of  2-3  trees : 


Consider  a  2-3  tree  -where 


every  ancestor  of  leaf  L  has  3  sens.  Consider  a  sequence  cf 
two  commands  where 

i)  Command  c-,  causes  L  to  gain  a  new  brother 
ii)  Command  Cg  removes  the  cited  record 

Note  that  these  two  commands  will  force  the  2-3  algorithm  to 

consume  at  least  C(wN)  time.  Also,  note  that  the  2-3  tree  will 

be  in  the  same  state  after  the  execution  cf  these  two  commands 

as  it  was  before.  Each  repetition  of  a  cycle  of  these  two 


commands  will  thus  consume  an  additional  C(wN)  time.  Hence, 
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(1)  k>2 

(2)  Ct  £l  -  '{l^  1/k 

Alg(o*  ,  k)  -.Till  use  a  two-step  procedure  for  peri  erring-  insertion 


and  deletion  operations.  The  first  step  will  modify  the  tree  so 


that  the  user's  specified  record  is  either  inserted  into  or 


deleted  fron  the  tree  in  the  straightforward  manner  suggested 


by  this  command.  The  second  step  rn.ll  scan  the  rodified  tree 


to  determine  whether  any  node's  p(v)  ratio  was  caused  to  ereeed 
the  (o1  ,  1  -c^)  interval  by  the  first  step.  Tor  every  node 


violating  this  range,  the  second  seep  vail  utilize  one  of  two 
"rebalancing"  procedures  to  force  p(v)  bach  into  the  (ct ,  1  -  ; 


interval.  The  more  subtle  of  these  procedures,  called  RCTAT2(c^  ,  1:), 


is  defined  as  executing 

A)  the  single  rotation  of  Diagram  1  when  p(v)*;oc  rid 


1  - 


3 )  the  double  rotation  of  Diagram  2  -when  p(v)^c<  and 

?(vH)  >  r'-Ji* ' 

C)  the  nirror  image  of  the  ^receding  rotations  when 


p(v)  ^  1  -  cA 

rhe  above  ?.CTAT2(c^,  h)  subroutine  trill  be  used  by  Alg(0^  ,  k! 


to  rebalance  all  nodes  satisfying  (since  under  these 


circumstances  ?.CTAT2(o^  •  h )  necessarily  forces  the  p-ratios  of 
all  affected  nodes  bach  into  the  (c^,  1  )  rar.pe } .  A  different 

module  will  be  utilized  by  Alg(c^  ,  h)  to  rebalance  nodes  whose 


:'v<  * 


.p 

because  of  their  less  stable  nature. 


m  tms  case, 


the  trivial  but  inefficient  brute  force  method  will  be  milled 

* 


1 


'jUvi*1 


t- 


DOUBLE  ROT ATI Or 
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to  ensure  that  the  entire  v-rooted  subtree  has  p-ratics  tel  or.. ping 
to  the  (o^  ,  1  -cA  )  range.  She  inefficiency  of  the  brute  force 
method,  or  any  other  details  pertaining  to  it,  are  unimportant 
since  its  restricted  application  to  nodes  vri.il  snail  IT  values 
ensures  that  it  cannot  affect  the  runtine  magnitude  of  Alg( cA  ,  k). 

The  AlgCd-  ,  k )  procedure  described  above  is  a  miner 
generalization  of  a  similar  algorithm  that  was  originally 
proposed  by  I?i  ever  gelt  and  Reingold  in  ITR73.  Their  algorithm 
was  for  the  most  part  the  special  version  of  Alg(c^,  k)  resulting 
v/hen  k  =  2.  We  say  "for  the  most  part"  because  the  ITR73  algorithm 
contained  a  minor  error  of  omission  by  nov  including  the  brute 


force  module.* 


Tne  significance 


of  ITievergelt  and  Reingold's  work  was 


they  demonstrated  that  A2g(cA,  k)  possessed  the  standard  C(lcg 
insertion,  deletion  and  search  times  associated  vrith  3-trees: 
and  that  it  ensures  that  no  insertion  or  deletion  can  cause  any 
initial  3B(c A)  tree  to  violate  this  condition.  It  is  trivial  t 
generalize  the  ITR-73  results  for  any  cf*  and  k  parameters 
satisfying  equations  1  and  2  (only  the  runtime  coefficient  of 
the  Alg(c^,  k)  procedure  changes  as  one  varies  its  parameters 
within  the  permitted  range). 

Two  theorems  will  be  proven  about  the  Alg(c^,  k)  procedure 
in  this  section.  The  first  will  indicate  that  the  origir. ally- 
proposed  Alg(dl,  2)  procedure  has  an  C(w;T )  CERT  runtime  when 
applied  to  augmented  trees,  and  the  second  that  an  C(wlog  IT; 


o 


CERT  can  be  attained  if  I:  is  set  greater  than  2. 


*  To  understand  wh;/  thr  s  module 
of  p(v.  5  in  3ia~ram  1  when  IT,. 

j-.  *  —  j 


i 


s  necessary 
and  cA  are 


calculate  ric 
small. 


:lue 


•  v 
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Theorem  2: 


Application  of  the  A2g(c^  ,  2)  procedure  to  augmented 


trees  produces  an  algorithm  with  an  OCwTT)  CERT  (Th  5. 2. IT,'. 


Proof: 


Let  T  denote  a  B2(c^}  tree  which,  contains  the  three 


subtrees  T-^,  T2  ^3  as  sbovm  in  the  diagram  below: 


xl\ 


/  T2\ 


X  T 


Let  us  further  assume  that  these  three  subtrees  contain  respective! 


cf'  IT,  (1  -  2  c £}IT,  and  o1  IT  leaves.  Let  c-,  c2 
sequence  of  four  commands  such  that 


: ,  cencte  a 


1)  Command  inserts  a  leaf  into  the  T^  subtree 

2)  Command  deletes  the  same  leaf 

3)  Command  inserts  a  leaf  into  the  subtree 

4)  Command  ct  deletes  the  same  leaf 

It  is  easy  to  verify  that  the  previous  four  commands  will  cause 
AlsCoL,  2)  to  move  the  T9  subtree  from  the  right  side  of  node  v 


to  its  left  side  and  then  bach  to  the  initial  position,  lets 
that  these  movements  will  force  Alg(c^  ,  2)  to  consume  C ( *.vI7 >  time 
(because  the  SLS  structures  must  be  adjusted  whenever  T2  moves). 
Also  note  that  the  cycle  of  the  previous  four  commands  car.  be 
repeated  any  number  of  times.  Bach  repetition  will  consume 
0(wN)  additional  time.  Hence  Alg(a^»  2}  will  have  mo  C(wIT) 

CERT  runtime  for  the  same  reasons  that  the  AVL  algorithm  had 


this  runtime. 
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The  remainder  of  this  section  vri.ll  be  devoted  ~o  proving 
that  the  Alg(c/,  k)  generalization  of  the  bounded  balance 
algorithm  ’.Till  manipulate  augmented  trees  in  O(v;log  IT)  CDP.T 
runtime  if  k  is  chosen  greater  than  2.  In  this  discussion, 
we  will  speak  of  a  tree's  collective  depth.  This  will  be 
defined  as  the  sum  of  the  depths  of  its  leaves. 


Lemma  3«1:  Each  application  of  a  single  or  double  rotation 

to  a  node  v  by  the  Alg(o**»  k)  procedure  will  cause  the  collective 
leaf  depth  to  decrease  by  at  least  (k  -  2)  oC  I!_. 


Proof:  Similar  methods  are  used  to  verify  the  theorem  fc 

cases  of  the  single  and  double  rotations.  It  is  therefore  s 


1- 


cient  to  consider  the  single  rotation  of  Diagram  1. 

The  rules  of  the  Alg(<A,  k)  procedure  imply  that  this 
rotation  is  executed  only  when 

i)  there  are  less  than  cAlTv  descendants  in  the  T-^  subtree 
ii)  there  are  more  than  (k  -  1)  C>IT„  descendants  in  the  I, 

V  j 

subtree 


Also  it  is  apparent  that  the  single  rotation  of  Diagram  1  causes 
iii)  the  depth  of  all  records:  in  the  T^  subtree  to  increase  by 
iv)  the  deoth  of  all  records  in  the  T,  subtree  to  decrease  by 
The  collective  implication  of  i)  through  iv)  is  that  this  rotation 
will  produce  the  predicted  loss  of  leaf  depth. 
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Lemma  3.2:  If  k>2,  then  the  amount  of  tine  that  the  Alg(o< 

procedure  'Till  need  to  make  such  adjustments  in  the  SDS  fields 
will  always  be  proportional  to  the  amount  of  leaf -depth  lost 
during  these  rebalancing  transformations.  more  specifically, 
the  ratio  of  tine  spent  adjusting  the  SDS  fields  over  loss  of 
leaf -depth  (during  rotations  )  will  always  be  less  than  or 
equal  to  (?h  3.2.R). 

Proof :  Hote  that  the  Alg(c^,  k)  procedure  requires  n c  more 

than  v/IIy  runtime  to  adjust  the  SDS  fields  when  it  performs  a 
rotation  and  that  the  previous  lemma  indicated  a  lower  bound 
on  the  associated  loss  of  leaf-depth.  The  present  lemma  is  a 
direct  consequence  of  these  observations.  QED 

Theorem  3 :  If  k  ^  2,  then  the  Alg(c4 ,  k)  procedure  will  hav 

an  O(\vlog  IT)  CERT  runtime  when  it  manipulates  augmented  trees 
(Th  3.2.3). 

Proof :  ITote  that  the  only  part  of  the  Alg(c*.  ,  k)  procedure 

which  is  capable  of  taking  more  than  O(wlog  IT)  time  is  its 
ROT ATS (d  1  k)  step.  Also  note  that  the  previous  lemma  indicated 
that  the  runtime  of  this  step  is  proportional  to  the  loss  of 
leaf-depth.  The  theorem  will  thus  be  proven  if  sufficient 
bounds  are  verified  on  the  size  of  the  collective  leaf-depth 
of  23(c^)  trees.  Such  bounds  follow  from  the  observation  that 
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if  II  denotes  the  maximum  size  of  tree  T  during  command  sequence 
c,  then  the  nonrotating  aspects  of  these  connands  cannot  increase 
the  collective  leaf-depth  of  T  by  more  than  C( J  C  j  log  II;. 


QED 


Comment  1:  It  is  interesting  to  re-examine  ITievergelt  and 

Reingold's  original  Alg(^-,  2)  procedure  in  light  of  the  previous 
two  propositions.  Rote  that  Theorem  3  holds  only  when  1:  >  2.  and 
it  therefore  did  not  preclude  Alg(oi  ,  2)  from  having  an  0(7/17) 

C3RT  runtime.  In  the  context  of  augmented  trees,  Alg(ot>  ,  lc) 
is  thus  considerably  more  efficient  than  Alg(C^  ,  2). 

Comment  2:  Also,  it  is  interesting  to  compare  A2g(ot ,  i) 

to  the  slightly  different  procedure  proposed  in  Lu?8.  Per 
PqCIc)  pyramids,  lueker '  s  algorithm  vn.ll  have  an  C(wlog  77)  C2RT. 
However,  it  dees  not  generalize  easily  to  other  types  of 
augmented  trees  and  must  have  an  0(wlog2  77)  C2RT  for  certain 
types  of  trees. 


J 
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PART  2 


The  next  tv/o  sections  of  this  paper  vri.ll  be  devoted  to 
explaining  hov;  the  C(v/log  IT)  GIRT  of  Alg  ( cL  ,  k)  car.  be  convert* 
into  a  strict  worst-case  runtime  vri.th  a  s  one  what  nore  sophistic; 
procedure  called  Alg  (ct,  i3,  E,  J).  An  intuitive  description 
of  Alg  (oU  X,  J)  will  be  given  in  this  section  with  a  nore 
detailed  discussion  in  the  next  section. 

To  understand  the  intuition  behind  this  procedure,  it  shoul 
be  recalled  that  the  O(wlog  27)  CERT  run  tine  cf  Alg  ( oL ,  inpl: 
that  ever;r  sequence  C  of  insertion  and  deletion  commands  nust 
consume  less  than  (cj  leg  27  runtir.e  (when  applied  to  an  initial: 
empty  tree).  The  desired  0(v;log  IT)  worst-case  runtime  will  thu; 
be  achieved  if  the  sequence  is  stabilised  so  that  every  one  of 
its  commands  consumes  no  more  than  the  sequence's  approximate 


average  runtime.  Alg  (c*,  B,  Z,  J)  will  attain  its  C(v;lcg  IT) 
worst-case  runtime  by  performing  this  optimisation. 

Llore  specifically,  Alg  {d  ,  B,  E,  J)  will  differ  from 
Alg  (ol>  k)  by  precluding  the  possibility  that  C(wIT)  worst-case 
runtime  is  spent  adjusting  the  3ES  fields  when  the  user  gives  a 
command  that  causes  a  rotation.  Instead,  it  will  invoke  an 
evolutionary-  process  for  gradually  building  the  new  SDS  field 
while  the  user  gives  a  large  number  of  commands.  The  purpose 
of  this  method  will  be  to  insure  the  achievement  of  C( wlcg  IT, 
worst-case  runtime. 


In  cur  formal  discussion,  reference 
fields  which  are  "martially  constructed" 


will  be  made  to  Ill 
as  opposed  mo  "full;/ 
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constructed."  3 y  "partially  constructed"  we  near  an  SDS  f: 
that  has  not  yet  been  completely  built  by  the  evolutionary 
process.  An  SDS  field  which  is  not  partially  constructed 
be  called  "fully  constructed."  Also,  the  terns  "current"  : 
"anticipated"  nodes  -.Till  be  used.  The  forner  refers  to  an 
interior  node  presently  existing  in  an  augmented  tree,  and 
the  latter  to  a  new  node  that  a  special  nodule  of  the 
Alg  (c^  ,  Z,  J)  procedure  "anticipates"  will  be  inserted 
into  this  tree  after  a  forth coning  single  or  double  rotati: 
For  instance,  if  Alg  (  c* ,  8,  Z,  J)  anticipates  the  futzire 
execution  of  the  double  rotation  shown  in  Diagram  2.  then  • 
nodes  v-  and  v^  would  be  said  to  be  "anticipated."  In  rern 

■D  w  ~ 

the  symbol  A?7TICIF(v,  ?.)  will  denote  those  nodes  that 
Alg  (c 0,  Z,  J)  anticipates  will  be  inserted  after  the 
application  of  a  rotation  E  to  a  node  v. 

The  Alg  (ji,  0,  K,  J)  procedure  will  be  designed  to  i: 
that  all  its  current  interior  nodes  have  fully  constructed 
fields  (since  it  would  otherwise  mahe  very  little  sense  to 
employ  augmented  trees).  Its  anticipated  SDS  fields  will 
typically  be  partially  constructed.  The  subtle  component 
of  Alg  ( o( ,  8,  K,  J)  is  the  evolutionary  process  that  effic 


converts  the  initial  "partially  constructed"  SDS  fields  of 
"anticipated  nodes"  into  the  final  "fully  constructed"  SDS 
fields  of  "current  nodes."  huch  of  the  essence  of  this 
evolutionary  process  can  be  explained  by  examining  the  J 
parameter  cf  Alg  {cL  ,  fi,  Z,  J).  For  constant  J  whose  vslus 


Leld 

■Till 
znd 


i  sure 
SDS 
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will  be  defined  later,  Alg  Q,  Z,  J)  will 

i)  anticipate  the  application  of  a  rotation  to  node  v  at 
an  early  enough  tine  to  ensure  that  at  least 
insertion  and  deletion  connands  are  applied  to  the 
v-rooted  subtree  between  the  tine  of  anticipation 
and  the  tine  when  the  rotation  is  performed 
ii)  gradually  construct  the  SDS  fields  required  by  this 

rotation  during  this  intervening  period  in  a  manner  that 

a)  expends  Jm  runtime  building  each  SDS  field  of 
ANSICIPCv,  R)  on  every  occasion  when  an  insertion 
or  deletion  command  is  applied  to  one  of  v's 
descendants 

b)  ensures  that  all  the  SD3  fields  of  AI!lICIR(v,  ?.; 
have  become  fully  constructed  before  that  time 
when  Alg  (o/  ,  3,  K,  J)  applies  rotation  R  to  node  v 

Cnee  again  it  should  be  repeated  that  the  virtue  of  the  above 
process  is  that  it  makes  possible  an  C(wlo~  IT)  worst -case  runtime 
in  the  context  of  a  tree  whose  current  5PS  fields  are  always 
fully  constructed.  A  more  formal  description  of  Alg  ( cx .  J) 

will  be  given  in  the  next  section. 
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data  structure  (and  also  obviously  incrementing  the  value  of 
TZh?(v)  to  reflect  these  insertions}. 

It  is  straightforward  to  develop  an  implementation  of  the 
GCAS  procedure  that  requires  G(Jw)  run  tine  upon  each  ir.vocatic 
(because  in  this  runtine  the  procedure  can  walk  J  steps  down  a 
linear  list  of  records  and  e upend  v  runtine  inserting  each 
encottr.tered  record  into  SD3(v)).  A  formal  description  of  C-CA3 
can  be  found  in  part  3.4.E  of  ;7i78a.  In  the  interests  of 
brevity,  this  basically  trivial  subroutine  will  not  be 
described  here. 

Another  necessary  concept  in  this  section  is  that  of  an 
"untinely  disruption."  Anticipated  node  v*  v;ill  be  said  to 


experience  an  untinely  disruption  if  either  3cr(v*;  cr  _:T7'v*) 


undergoes 


vaiue . 


let  us  recall  that  IT,,  is  ’used  in  this  rarer  to 

V  ~ 

number  of  v's  descendants  and  C  to  denote  a  sequence 
and  deletion  ccnnands  whose  length  is  denoted  as  |  C  j 


‘CV’  ~‘CV 


md  7 _ vri.ll  respectively  der.ore  she 


and  final  values  of  II  during  C. 

v  ° 


henna  4.1 


Let  v  denote  an  interior  node  in  tree 


an  anticipated  node  belonging  to  A!TTICI?(v,  E),  and 
seouence  cf  connands  that  insert  and  delete  records 


subtree  of  2  which,  is  rooted  at  v.  Suppose  that  !  3  i 
of  the  GCAS  procedure  are  made  with  the  arguments  of 
during  the  period,  cf  C's  execution.  SDSfv ■- }  will  be 


Willard 


full"  constructed  before  she  end  of  this  rerioa  if  experiences 


NO  UITTIYELY  DISYUYTICITS  durian  this  reriod  and  ar." 


"  one  cr  tue 


following  three  condi cions  does  hold : 

(3)  )  C  1  >  Icv  /  (J  -  1) 

14)  I  C  I  >YCV  /  (J  -  1) 

(5)  1  C  |  >?cv  /  (J  -  1) 

lema  4.1  requires  no  proof,  since  it  is  an  absolutely 
trivial  consequence  of  the  definitions  of  the  GCAS  procedure 
and  of  partially  constructed  SIS  fields,  host  of  the  rest  of 
this  section  v.ill  shcv;  hov.r  the  Alg  (cf  .  ^ .  II,  J;  procedure  attains 
its  efficiency  by  insuring  that  the  conditions  cf  this  lean a 
are  consistently  satisfied. 

Another  definition  that  7/ill  be  needed  in  cur  discussion 
is  that  cf  a  node  v's  being  "affected”  by  a  rotation.  This 
shall  be  defined  to  mean  that  one  of  the  following  two  concisions 
is  satisfied: 

i)  the  node  lies  at  the  root  of  the  subtree  which  is 
directly  ranipulated  by  this  rotation  (  An;  proto v- re 
example  is  v  in  Diagrams  1  and  2) 
ii)  the  node  is  physically  produced  by  the  rotation 
(vA,  v3  and  vc  in  Diagrams  1  and  2  are  examples) 


Derma  4.2: 


let  C  denote  a  secuerce  of  corr.ands 


:i ven  so 


insert  and  delete  records  in  the  subtree  chat  has  v  as  a  rcco. 


If  no  rotation  affects  v  durir.c  the  ran o< 


J'  4  <*'> --  V*  ^  v> 

->  V.  -  —  w.-  •  -  A  i. 
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the  p(v)  ratio  car.  change  by  r.o  more  than  C  /II  during  this 
tine  {Th.  3.6.2). 

The  proof  of  Lenna  4.2  has  been  onitted  fron  this  rarer 
because  it  is  trivial  (it  can  be  found  in  '.Vi7Sa),  Our  next 
topic  is  the  ROTATE id-  ,  k)  subroutine.  This  subroutine  was 
previously  used  by  the  Alg(oi  ,  k)  procedure  to  determine  whether 
a  single  as  opposed  to  a  double  rotation  should  be  applied  to 
rebalance  a  node  whose  p(v)  ratio  exceeds  the  {d  ,  1  ~d)  range. 
In  sinilar  manner,  it  will  be  used  by  Algid  .  Q ,  II,  J)  .  Only 
the  tiring  of  rotations  will  differ  in  A Igid-,  h)  and 
Algicd  ,  fi,  II,  J).  Instead  of  autonatically  executing  a 
rotation  as  soon  as  p(v)  exceeds  (<=*■,  1  -c* ) ,  Algid  ,  Q,  R,  -J ) 


will  utilize  a  conplicated  timing  nechanisn  which  invokes 
ROTATE  id  ,  k)  at  sone  tine  when  p(v)  has  exceeded  (e/,  1  -oO 
but  still  lies  within  a  broader  (  £,  1  -&)  range.  The 


following  preliminary  lenna  will  play  a  major  role  in  err 
later,  more  detailed  discussion  of  Alg(  d-  ,  B,  Z,  J). 


Lenina  4.3?  There  exist  coefficients  A,  d  ,  ^  and  k 

satisfying 

&  >  at.  >  8 

such  that 

i)  for  every  tree  ail  of  whose  nodes  have  p-ratios 
belonging  to  the  ( 3 ,  1  -  3  )  range 
ii)  for  every  node  v  within  this  tree  whose  p(v)  ratio 
exceeds  the  (oi  ,  2.  -  d  )  interval 
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the  application  of  the  ROTATE  (c?4,  1:)  subroutine  to  the  subtree 
rooted  at  v  will  cause  all  "he  affected  nodes  to  obtain  p-ratios 
belonging  to  the  (A,  1  -A)  interval  (Th  3- 6. A). 


One  way  to  verify  the  preceding  lemma  is  to  shew  the  a  ia  is 
a  consequence  of  more  or  less  straightforward  algebra.  Although 
each  step  of  this  derivation  is  sirple,  the  total  computation  is 
made  complicated  by  the  fact  that  it  requires  the  verification  of 
approximately  three  dozen  equations.  Such  analysis  is  clearly  toe 
lengthy  for  one's  intuition,  and  there  does  exist  a  simpler  proof 
for  those  who  understand  point-set  typology . 

The  latter  preof  has  two  parts.  It  begins  with  the  observa¬ 
tion  that  Lemma  4.2  can  be  algebraically  verified  if  the  lemma 
is  changed  to  read:  p(vj  =  c*  =©.  Subsequently ,  the  proof  uses 
typological  arguments  about  continuous  functions  to  show  that 
Lemma  4.3  is  a  consequence  of  this  observation. 

In  the  interests  of  brevity,  the  details  of  this  latter 
proof  will  r.ot  be  given  here.  Readers  interested  in  this 
subject  should  consult  pp.  138-144  of  V/i7 8a. 

In  our  discussion  of  Alg(c*,  /3,  Z,  J),  it  will  be  assumed 
that  the  d  ,  Pf  v  and  A  parameters  have  been  chosen  to  satisfy 
Lemma  4.3  ?-r.d  that  J  has  been  chosen  to  satisfy 
(6)  J  >  3  +  7 


(7) 


J  >  3 


£*(A-c <) 

_ 

ok  -  3 
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Cur  formal  description  of  AlgC^t  ,  0,  II,  J)  begins  in  the  r.e:ct 
paragraph.  This  procedure  will  be  carefully  designed  to  ensure 
that  all  nodes  have  p-ratios  consistently  belonging  to  tne  (  <3  .  1 
range.  It  will  utilize  the  GCAS  subroutine  for  constructing 
anticipated  SDS  fields  in  the  evolutionary  manner  previously 
described.  If  a  node's  p(v)  ratio  exceeds  ( cA  ,  1  -ck)  but 
lies  within  ( /3  ,  1  -0),  then  Alg(  c*  ,  B> ,  E,  J)  will  have  the 
ROTATE ( ct  ,  k)  subroutine  apply  a  rebalancing  rotation  E  at  the 
first  moment  in  tine  when  the  SDS  fields  of  AI7TICI?  (v,  R)  have 
reached  fully  constructed  states.  Should  full  construction  not 
be  reached  before  the  tine  when  p(v)  exceeds  the  (0,  1  -  &  ) 
interval,  then  Alg( cA  ,  0,  Z,  J}  will  invoke  an  inefficient 
construction  procedure  that  requires  0(wII„,log  IT  )  run  tine  to 

V  V 

reorganize  the  v-rooted  subtree.  The  efficiency  of  Alg(cf,  0.  I.', 
will  be  shewn  to  result  fron  the  fact  that  a  proper  choice  of 
parameters  ensures  that  the  inefficient  procedure  described  in 
the  previous  sentence  will  be  executed  only  7/hen  is  bounded 
above  by  sene  snail  constant  of  I.I.  This  paragraph  is  intended 
only  to  provide  a  brief  sketch  of  the  Alg(  II,  J)  procedure. 

A  nuch  more  detailed  discussion  now  follows. 

FCRIIAI  DESCRirTlCIT  OF  AIG  ( oC .  3 ,  II,  1):  The  procedure  v.all 
take  as  arguments  an  augmented  tree  EuIlCi  8.  C CHIU. 871  CL  to  ©1  "til SI* 
insert  or  delete  a  record  y  into  T.  It  will  be  assured  that 
the  four  parameters  of  Alg(  C*  ,  S,  II,  J)  satisfy  lemma  4.3  and 
equations  5  and  7.  Also,  it  will  be  presumed  that  there  is  a 
one-to-one  correstonder.ee  between  the  leaves  of  T  and  the  records 
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of  the  list  it  is  representing  (as  opposed  to  a  pairing  between 
general  nodes  and  records).  Upon  the  user's  insertion  or  deleeicr. 
cacnand.  the  following  procedure  will  be  enecuted: 

1)  First,  insert  or  delete  those  nodes  suggested  bp  the 
user's  c onnand .  Llore  specifically,  this  means  that: 

la)  If  the  user  gave  a  deletion  c onnand .  then  both 
record  y  and  its  father  should  be  deleted  (the 
latter  because  an  interior  node  has  nc  purpose 

v/hen  it  has  only  one  son).  Along  with  these  changes, 
the  relevant  pointer  in  y's  grandfather  is 
adjusted  so  that  it  contains  the  address  of  y's 
brother  rather  than  father. 

lb)  If  the  user  gave  an  insertion  command,  then  perform 
the  approximate  inverse  of  the  above  operatic:-.. 

2)  IText .  update  all  necessary  SDS  fields  to  reflect  she 
preceding  insertion  or  deletion  of  y.  litis  means  that 

2a)  All  (current)  ancestors  of  y  should  have  their 
SDS  fields  updated. 

2b)  Also,  for  every  such  ancestor  v,  a  cheer  should  bs 
made  to  determine  whether  Alg( d.  ,  8 ,  II.  J)  has 
raised  a  flag  indicating  the  anticipation  of  a 
rotation  on  the  subtree  rooted  at  this  node.  If  so. 


then  the  anticipated  SDS  fields  of  AD!  I  Cl?  (v .  ?.) 
should  be  updated  when  appropriate. 

3)  The  last  portion  of  Alg ( o(  .  8 ,  IZ ,  J)  will  be  designed  to 

ensure  that  the  p-ratios  of  all  nodes  belong  sc  the  {B .  1-6' 
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interval.  This  step  is  the  only  portion  of  A1  g(oi  ,^3,  II,  -X) 
v/hich  requires  subtlety  to  attain  efficiency.  The  following 
six  substeps  vd.ll  be  invoked  once  for  each  (current) 
ancestor  v  of  y  in  bottom -up  order: 

3a)  If  p(v)i^cf,  then  raise  a  flag  indicating  the 
anticipation  of  a  leftward  rotation  through  node 
v  (if  this  flag  v;as  not  previously  raised). 

Similarly,  lower  this  flag  if  p(v)  >c<. 

3b)  In  like  manner,  raise  a  right-rotation  antic-gaticn 
flag  if  p(v)  >  1  -o< ,  and  lower  it  otherwise. 

3c)  Let  R^  and  R ^  deno  '■j  6  e  single  and  double  left 
rotations  illustrated  in  Diagrams  1  and  2.  If 
the  left-rotation  flag  is  raised  over  v,  then 
make  one  subroutine-call  to  GCAS  for  each 
anticipated  node  in  A2TTICI?  (v,  H-,  ;  and  ADTI0IT(v,  ?.p ) . 
with  the  specification  that  their  3DS  fields  should 
each  be  enlarged  by  J  elements.  (A  total  of  three 
anticipated  nodes  are  affected  by  these  su'crc.:. tine- 
calls:  in  the  notation  of  Diagrams  1  and  2.  they 
are  VA,  V3  and  Vc. ) 

3d)  If  the  right-rotation  flag  has  been  raised,  then 
make  the  similar  subroutine-calls  to  GCAS  for  it, 

3e)  Let  ?.  denote  that  rotation  which  the  ROTATE  (oC,  1; 
subroutine  indicated  would  rebalance  node  v.  If 
p(v)  has  a  value  lying  outside  the  (  d  .  1  -c(  ) 
interval  but  still  within  the  ( B ,  1  -B)  range. 
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and  if  all  the  SDS  fields  of  AI72ICI?(v,  ?.)  have  reached 
fully  constructed  states,  then  execute  this  rotation. 
3f)  If  p(v)  attains  a  value  ouuside  the  (£>,  1  -B  ) 

interval,  then  invoke  the  trivial  inefficient  procedure 


that  requires  wIT^log  IT  runtine  to  ensure  that  all 


members  of  the  v-rooted  subtree  have  p-ratios  belonging 
to  the  (1/3,  2/3)  interval. 

Comment :  Cne  of  the  there s  of  the  remainder  of  this  section 
will  be  that  step  3f's  inefficiency  is  unimportant  because  a 
later  theorem  will  show  that  it  is  executed  only  when  IT.,  is 
less  than  some  small  constant  of  II.  Before  delving  into  this 
topic,  we  introduce  some  preliminary  lemmas. 


lemma  4.4:  .4  tree  manipulated  by  the  A IgidL  ,  B ,  Zf  J) 

procedure  must  have  a  height  of  C(log  IT)  magnitude  (2h  3.6.3 
and  3 • 6 . C ) . 


Proof: 


In  view  of  Lemma  4.3,  it  is  apparent  that  steps  3= 


and  3f  of  Alg($t,  &,  IC ,  J)  ensure  that  the  p-ratios  of  ail  nodes 
in  T  belong  to  the  {P.  1  -  &  )  interval.  IT?.-? 3  has  indicated 
that  such  ratios  ensure  an  O(log  IT)  height. 


QED 


Lemma  4 . 5 :  A  single  invocation  of  Alg(o(  , P.  II,  J)  will  r.evo: 
cause  steps  1,  2.  Itx  through  2e  to  collectively  consume  more  the: 
C(v;log  IT)  worsu-case  runtime  (ih  3.4.1,  2.5.3  ar.d  3.5.?). 


7/illar; 


Proof:  Given  a  tree  of  height  h,  it  is  easy  to  see  that 

1)  Step  1  will  consume  no  more  than  C(h)  runtime  (to 
locate  that  record  which  should  be  inserted  or  dele  zee 

2)  Step  2  will  likewise  require  nc  more  than  0(hv;'  nmti: 

3)  A  single  invocation  of  Alg(ot  ,  Z,  J)  -.Till  cause  ste 
3a  through  3e  to  be  iterated  no  more  than  h  times  witi 
each  execution  consuming  no  more  than  cw  runtime  for 
some  coefficient  c  that  is  a  function  of  <£,  Q,  II  and  J 

The  preceding  observations  imply  that  steps  1  through  2e  will 
collectively  consume  no  more  than  O(wh)  runtime.  In  view  of 
Lemma  4.4's  derivation  of  h,  this  establishes  our  C( wlog  17} 
worst-case  runtime .  QZD 


The  final  goal  of  this  section  will  be  to  prove  that  step 
also  executes  efficiently.  That  result,  in  conjunction  with 
Lemma  4.5-  will  show  that  Alg( c<  ,  Z,  «7)  has  an  C(v.log  17} 
worst-case  runtime.  Our  discussion  of  this  topic  must  be 
prefaced  with  f our  preliminary  lemmas. 


Lemn.a  4.6:  Let  C  denote  a  sequence  of  insertion  and  delcai 

commands  that  are  applied  to  the  portion  of  tree  2  the a  is  a 
descendant  of  root  v.  Suppose  that  either  p(v)^e<  or  p(v }  I 


during  this  sequence.  If  the  length  of 

(8)  IcU  fe*(A-<*)  ::CT1 


rr 

w' 


c o  am* 

bl-do 


Proof: 
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It  is  sufficient  to  prove  tie  lenta  only  for  the  case 
where  p(v)<ef  interval  (since  the  other  case  will  follov/  iron 
symmetry  arguments ) .  The  proof  for  this  case  v/ill  be  based  on 
the  following  three  observations: 

1)  Let  vR  denote  v’s  right  son  and  v^  denote  the  left  scn 
of  vR  (as  was  previously  shown  in  rotation  diagrams 

1  and  2).  It  is  easy  to  see  that  untimely  disruptions 
in  this  case  are  possible  only  if  steps  }e  or  3 f  apply 
a  rotation  to  the  subtrees  rooted  in  either  v~  or  v- . 

2)  Let  s  denote  the  node  of  v0  or  v-,.  An  enarination  of 
Lenna  4.3 ,  together  with  steps  2e  end  3f  of  Alg(os5,  y.  } , 
reveals  that  such  rotations  will  be  made  only  when  p(s, 
lies  outside  the  (c^,  1  -c^  )  interval,  and  that  they 

will  cause  p(s )  to  move  inside  the  (A,  1  -A)  range. 


3 )  Lenna  4.2  and  equation  8  easily  imply  that  if  either 
?(vR)  or  p(vD)  lie  inside  the  (A,  1  -  A)  interval 
(at  the  end  of  any  single  command  in  sequence  0). 
then  this  value  will  remain  inside  the  (ot.  1  -  c<) 
range  for  the  rest  of  this  command  sequence. 

It  is  fairly  easy  to  show/  that  the  preceding  three  obser/ati 
imply  that  A2T!ICI?(v,  ?. )  v/ill  be  caused  to  have  no  more  than 
one  untimely  disruption  from  node  vR  and  no  more  than  one 
untimely  disruption  from  v^.  Thus  no  more  than  two  untimely 
disruptions  v/ill  occur.  CSD 


Lemma  4.7: 


Let  C  denote  a  sequence  of  insertion  or  delctior 


commands  that  are  annlied  to  a  subtree  v/ithin  7  v/hioh  is  roc  tec 


at  v.  If 


I  c  | 


5  fel 


then  it  is  impossible  for  the  value  of  n(v)  to  remain  ir.  either 


in  v/hioh  secnence  C  is  executed  (Th  3.6.G). 


Prcc* : 


The  lemma.  will  be  verified  by  means  of  contradictior 


Cur  goal  will  thus  be  to  sho  V7  t  a  contradiction  will  arise 

if  it  is  assumed  that  p(v)  will  remain  in  either  the  or 

(1  -ot  ,  1  ~/3  )  interval  during  the  entire  execution  of  sequence 


Let  C*  denote  the  seque 


nee  of  the  last  ^ ...... ^Y-Tj 


appear  in  sequence  C.  The  desired  contradiction  will  be  obtain 
if  we  examine  this  sequence. 

The  significant  characteristic  of  sequence  C*  is  that  eque 
S  indies  that  its  length  must  satisfy  the  ir.ecur.lit;-  of 


(10)  |c*|<  fe'A-ot) 


The  above  equation,  together  with  the  hypothesis  of  Lemma  4.7 . 
imply  that  Lemma  4.6  is  applicable  to  sequence  C*.  That  lemma 
indicates  this  sequence  can  contain  no  more  than  two  untin.el: 
disruttions.  It  thus  follows  that  sequence  C*  will  contain  a 


subsecuer.ee  c 


.p  r?cv 

I  J  -  3 


consecutive  commands  that  contain  no 
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untimely  disruptions.  In  viev/  of  Lemma  4.1,  it  is  apparent  aha' 
AIITICIP(v,  R)  v.dll  become  fully  constructed  as  a  result  of  the 
repeated  invocations  of  GCAS  that  are  made  by  steps  3c  and  }d 
during  this  period.*  This  latter  fact  is  important  because 
step  3e  of  A1  K,  J)  is  designed  to  apply  rotation  ?. 

to  node  v  as  soon  as  the  A2-TTICI?(v,  R)  SDS  fields  reach  stages 
of  full  construction.  In  view  of  Lemma  4.3,  this  rotation  will 
move  p(v)  into  the  (A,  1  -  A)  interval,  which  is  a  subset  of 
(c*  ,  1  -cM.  Hence,  the  desired  contradiction  has  been  reached , 
since  it  was  impossible  to  keep  p(v)  outside  (c*,  1  -ts). 

.  QRL 


Lemma  4.8: 


;ep  31“  of  Alg  (<>.,  &,  II,  J)  applies  a  rctati: 


to  node  v,  then  at  the  time  of  this  rotation  u  must  satisfy  the 
following  equation  (Th  3 .  6.H) : 


(11) 


3Hy  + 


3~i 


>  f(d-0)  :h]  -  l 


i 


J  -  3  l 


Proof :  ITots  that  Alg(c^,  ft,  II,  J)  will  apply  step 


jr  to  no ce 

v  only  if  this  node's  p(v)  value  lies  outside  the  (3,  1-6) 


range.  A Iso,  note  that  Lemma  4.2  implies  that  the  period  be twee 
the  last  time  v/hen  p(v)  retained  a  value  inside  the  (ct  ,  1  -  C<) 
interval  and  the  moment  v/hen  it  moves  outside  the  ( (3 .  1  -S) 


*  Some  especially  attentive  readers  may  /.rich  to  understand  v;Vi*.* 
slightly  different  denominators  of  J’-l  and  J-3  appeared  in  Lemma 
4.1  ar.d  the  present  lemma.  The  answer  is  that  the  lemmas  di 
different  sequences  of  command  with  the  4.1  sequence  contain 
within  the  present  sequence .  Consequently  Lev  has  slightly  cliff 
ent  meanings  in  these  two  lemmas,  and  the  denominator  must  be 
adjusted  to  make  Lemma  4.1  applicable  to  Lemma  a. 7. 


i  )  r> 
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interval  must  include  a  minimum  of  p(  d  -  £  )  IT^"|  insertion  and 
deletion  commands  which  are  applied  zo  v ' s  sea  of  leaf  descendants. 
This  fact,  in  other  words,  means  that  there  nust  he  a  minimum.  of 
ITV""|  ~  1  conn  ends  that  are  applied  to  v's  subtree 
during  the  interin  period  when  p(v )  lies  in  either  the  ( £  . 1 
or  (1  ,  1  )  intervals .  The  proof  of  the  present  theorem 

can  be  completed  if  we  observe  that  Lemma  4.7  implies  that  this 
same  set  of  insertion  and  deletion  commands  is  forbidden  to 


contain  more  than 


r  3EV  -  t  . 

I  -  instructions. 

1  J  -  3 


The  observations 


J  -  3 

in  the  preceding  two  sentences  imply  that  equation  11  must  hold. 


rycTi 


lemma  4.9:  Each  invocation  of  saep  3f  of  A1  g(o<  ,B.  IC ,  J) 

vh.ll  consume  no  more  than  C(w)  runtime. 


Proof : 


Equation  7  implies  that  equation  11  can  be 


satisfied  only  for  integers  7.f  that  are  less  ahan  some  fined 
constant  of  T.  In  view  of  Lemma  4.8,  this  bound  implies  ahrh 
step  3f  is  applied  only  to  nodes  that  have  less  than  II 
descendants.  Hence  step  3f  must  consume  less  than  illog  Hw 
runtime  for  some  fixed  constant  Id.  QEE 


»/>**>  wv  •• 
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Proof: 


An  immediate  consequence  of  Lemmas  4.4,  4. 


Renark  4.1: 


An  augmented  tree  will  be  said  to  satisfy  the 


separability  condition  if  all  pointers  within  each  3BS  field 
necessarily  contain  the  addresses  of  other  data  items  in  ties 
sane  SPS  field.  All  the  augmented  trees  in  3S77 ,  Lu7S,  LV.'Tob, 
V/i78a,  and  part  2  of  \'J±7 8c  were  separable,  and  the  super-1 -tree 
theorem  thus  guarantees  their  C(wlog  IT)  worst-case  update  tine. 
Technically,  the  theorem  does  not  apply  to  inseparable  augmented 
trees,  such  as  the  P-(2)  pyramid  of  Y.;i73a  or  she  P.  (2)  sy ranid 
of  ;7i78d  (which  lock  together  the  3DS  fields  of  father  and  son 
nodes  by  having  records  in  one  field  point  to  their  counterparts 
in  the  other),  nevertheless,  it  is  absolutely  trivial  to  modify 
the  super-1- tree  algorithm,  so  that  it  also  applies  to  these 
latter  types  of  pyramids.  A  detailed  discussion  of  this  revisicr 
has  been  omitted  from  this  paper  because  it  raises  no  new 
interesting  theoretical  issues. 


Remark  4.2:  A  detailed  examination  of  Alg(c£  ,  II,  f ) 

indicates  that  it  has  an  efficient  CZRT  runtime  coefficient 
but  an  inefficient  worst-case  coefficient.  Both  coefficients 
can  be  markedly  improved  with  the  use  of  several  additional 
modules.  One  especially  attractive  nodule  is  a  procedure  that 
treats  step  3  of  Alg  (Cf  ,  0,  II,  J)  as  a  background  process 
whose  node  rebalancing  operations  are  typically  deferral  to 


I 
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tines  when  the  computer  would  otherwise  be  idle,  risers  are  al 
many  other  important  coefficient  optimisation  techniques.  the 
v/ere  not  discussed  here  because  they  were  deemed  unsuitable  f  c 
an  introductory  article. 


Remark  4- .  3  : 


There  is  an  interesting  modified  version  of  th 


super-3-tree  algorithm  that  nay  be  useful  in  certain  applicati 
This  variation  will  be  called  the  multipath  super-3 -tree -  It 
assign  each  interior  node  a  multiple  number  of  children,  rathe 
than  two.  The  IT  value  of  a  node  v  at  a  death  d  in  such  a  t~s 
with  IT  descendants  will  be  recuired  to  sati  sf;* 


c*  _1  h"d  it  <  i:  <  ::‘c 


'or  seme  aarrorriately  chosen  constants  of  h 


ol 


defined  node  splitting  and  merging  rules  will  enable  the  multi 
algorithm  to  have  C(wlog  IT)  worst-case  or  C3RT  insertion  an 
deletion  runtime  (  the  same  is  not  true  when  the  multiway 


algorithm  of  11:73  it  applied  to  augmented  trees). 


—  ■-  w-  J. 


between  multipath  and  binary  super-3-trees  reveal  that  the  for 
has  an  improved  memory  space,  insertion  and  deletion  ecefficie 
while  the  latter  shows  improvement  in  the  retrieval  time  ocefi 
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